Data Sources

Three sources of data are collected in this study, including stream quantity and quality data from United States Geological Survey (USGS), climate information from National Oceanic and Atmospheric Administration (NOAA), and stock price data from Yahoo finance.

Stream quantity and quality

The daily summary of stream quantity and quality are obtained from USGS at USGS 04193500 Maumee River at Waterville OH station (Figure 1). Figure 2 shows the download page. In this study, streamflow discharge from 1940 to 2022 was downloaded as shown in Figure 3.

Figure 1: USGS gaging station location

Figure 2: USGS data download page

Figure 3: USGS stream quantity and quality data example

Climate data

The daily summary of climate data, including precipitation, maximum temperature, and minimum temperature, are obtained from NOAA at Hydrologic Unit Code (HUC) 04100009 watershed where the water drained from to the 04193500 gaging station. Figure 4 shows the download page. In this study, climate information from 1940 to 2022 was downloaded as shown in Table 1.

Figure 4: NOAA data download page

Code
library(knitr)
climate <- read.csv("./data/climate1.csv",header=TRUE)
climate <- drop_na(climate)
kable(head(climate))
Table 1: Cliamte data example
STATION NAME DATE PRCP TMAX TMIN
USW00094830 TOLEDO EXPRESS AIRPORT, OH US 1955-01-01 0.26 47 28
USW00094830 TOLEDO EXPRESS AIRPORT, OH US 1955-01-02 0.00 47 34
USW00094830 TOLEDO EXPRESS AIRPORT, OH US 1955-01-03 0.04 50 31
USW00094830 TOLEDO EXPRESS AIRPORT, OH US 1955-01-04 0.00 46 30
USW00094830 TOLEDO EXPRESS AIRPORT, OH US 1955-01-05 0.92 55 33
USW00094830 TOLEDO EXPRESS AIRPORT, OH US 1955-01-06 0.08 37 30

Stock price

The stock price is from Yahoo finance using getSymbols() function of quantmod package in R. The data includes open price, highest price, lowest price, closing price, volume, and adjusted closing price.